# High-resolution processing

Webssl Dino7b Full8b 378
A 7-billion-parameter vision Transformer model trained on 8 billion language-unlabeled web images, achieving exceptional visual representation capabilities through self-supervised learning
Image Classification Transformers
W
facebook
68
0
Auramask Ensemble Poprocket
Gpl-3.0
This model uses an improved vnet architecture for image-to-image processing, supporting tasks such as adversarial, aesthetic, and quality enhancement
Image Generation
A
logasja
15
0
Aimv2 3b Patch14 448.apple Pt
AIM-v2 is an image encoder model based on the timm library, with a 3B parameter scale, suitable for image feature extraction tasks.
Image Classification Transformers
A
timm
79
0
Aimv2 3b Patch14 336.apple Pt
AIM-v2 is an image encoder model based on the timm library, suitable for image feature extraction tasks.
Image Classification Transformers
A
timm
35
0
Resnet50x64 Clip Gap.openai
Apache-2.0
CLIP model image encoder based on ResNet50 architecture with 64x width expansion, using Global Average Pooling (GAP) strategy
Image Classification Transformers
R
timm
107
0
Resnet50x16 Clip Gap.openai
Apache-2.0
A ResNet50x16 variant model based on the CLIP framework, focused on image feature extraction
Image Classification Transformers
R
timm
129
0
Coreml DepthPro
DepthPro is a monocular depth estimation model capable of predicting depth from a single image.
3D Vision
C
KeighBee
17
4
Convnextv2 Huge.fcmae
A self-supervised feature representation model based on ConvNeXt-V2, pre-trained using the Fully Convolutional Masked Autoencoder (FCMAE) framework, suitable for image classification and feature extraction tasks.
Image Classification Transformers
C
timm
52
0
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase